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Abstract 

^0 ' In this paper, we investigate condition numbers of eigenvalue problems of 

^ c"| matrix polynomials with nonsingular leading coefficients, generalizing classical 

results of matrix perturbation theory. We provide a relation between the con- 
dition numbers of eigenvalues and the pseudospectral growth rate. We obtain 
that if a simple eigenvalue of a matrix polynomial is ill-conditioned in some re- 
spects, then it is close to be multiple, and we construct an upper bound for this 
distance (measured in the cuclidean norm). We also derive a new expression for 
the condition number of a simple eigenvalue, which does not involve eigenvectors. 
Moreover, an Eisner-like perturbation bound for matrix polynomials is presented. 
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1 Introduction 

The notions of condition numbers of eigenproblems and eigenvalues quantify the sen- 
sitivity of eigenvalue problems [S^finitrilliaiiailSlSBl^ They 
are widely appreciated tools for investigating the behavior under perturbations of 
matrix-based dynamical systems and of algorithms in numerical linear algebra. An 
eigenvalue problem is called ill-conditioned (resp., well-conditioned) if its condition 
number is sufficiently large (resp., sufficiently small). 

In 1965, Wilkinson [25] introduced the condition number of a simple eigenvalue Ao 
of a matrix A while discussing the sensitivity of Ao in terms of the associated right and 
left eigenspaces. Two years later, Smith |19| obtained explicit expressions for certain 
condition numbers related to the reduction of matrix A to its Jordan canonical form. 
In early 1970's, Stewart [20] and Wilkinson [26] used the condition number of the 
simple eigenvalue Ao to construct a lower bound and an upper bound for the distance 
from A to the set of matrices that have Ao as a multiple eigenvalue, respectively. 
Recently, the notion of the condition number of simple eigenvalues of matrices has 
been extended to multiple eigenvalues of matrices [H [10], [11] and to eigenvalues of 
matrix polynomials [111 | 
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In this article, we are concerned with conditioning for the eigenvalue problem of 
a matrix polynomial P(X) with a nonsingular leading coefficient, generalizing known 
results of matrix perturbation theory [1J [TJ [TO] [19], I26j . In the next section, we give 
the definitions and the necessary background on matrix polynomials. In Section [3l we 
investigate the strong connection between the condition numbers of the eigenvalues of 
-P(A) and the growth rate of its pseudospectra. This connection allows us to portrait 
the abstraction of the condition numbers of eigenvalues. In Section 01 we examine 
the relation between the condition number of a simple eigenvalue Ao of -P(A) and 
the distance from -P(A) to the set of matrix polynomials that have Ao as a multiple 
eigenvalue. In particular, we see that if the condition number of Ao is sufficiently 
large, then this eigenvalue is close to be multiple. In Section El we provide a new 
expression for the condition number of a simple eigenvalue Ao of P(A), which involves 
the distances from Ao to the rest of the eigenvalues of P (A). Finally, in Section [6] we 
present an extension of the Eisner Theorem [71I2HI22] to matrix polynomials. Simple 
numerical examples are also given to illustrate our results. 

2 Preliminaries on matrix polynomials 

Consider an n x n matrix polynomial 

P{\) = A m X m + A m _ 1 A m - 1 + • • • + A X X + A , (1) 

where A is a complex variable and Aj £ £, nxn (j = 0, 1, . . . , m) with det A m ^ 0. The 
study of matrix polynomials has a long history, especially with regard to their spectral 
analysis, which leads to the solutions of higher order linear systems of differential 
equations. The suggested references on matrix polynomials are [9~1 [13] [15] . 

A scalar Ao £ C is called an eigenvalue of P{\) if the system P(Xq)x = has 
a nonzero solution xo £ C n , known as a right eigenvector of P(X) corresponding to 
Ao- A nonzero vector yo £ C n that satisfies VqP(\o) = is called a left eigenvector of 
P(A) corresponding to Ao- The set of all eigenvalues of -P(A) is the spectrum of -P(A), 
cr(-P) = {A £ C : detP(A) = 0} , and since det^4 m ^ 0, it contains no more than nm 
distinct (finite) elements. The algebraic multiplicity of an eigenvalue Ao £ o~{P) is 
the multiplicity of Ao as a zero of the (scalar) polynomial detP(A), and it is always 
greater than or equal to the geometric multiplicity of Ao, that is, the dimension of the 
null space of matrix P(Ao). 

2.1 Jordan structure and condition number of the eigenproblem 

Let Ai, A2, . . . , A r £ a(P) be the eigenvalues of P(X), where each Aj appears exactly 
ki times if and only if its geometric multiplicity is ki (i = 1, 2, . . . , r). Suppose also 
that for an eigenvalue Aj £ o~(P), there exist 2^,1,0^2, • • • ,Xi >s . £ C n with Xt,i 7^ 0, 
such that 

« 1 

p0_1) ( A *) = ; £ = i> 2, . . . , a f> 

3=1 U '' 

where the indices denote the derivatives of P{X) and Sj cannot exceed the algebraic 
multiplicity of Aj. Then the vector 2^1 is clearly an eigenvector of Aj, and the vectors 
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are known as generalized eigenvectors. The set 1, 2^,2, 



} 



is called a Jordan chain of length s, of -P(A) corresponding to the eigenvalue Aj. 
Any eigenvalue of P(X) of geometric multiplicity k has maximal Jordan chains 
associated to k linearly independent eigenvectors, with total number of eigenvectors 
and generalized eigenvectors equal to the algebraic multiplicity of this eigenvalue. 

We consider now the n x nm matrix X = [xi t i • • • xi, si X2,i • • • x r ,i • • • %r,s r ] 
formed by maximal Jordan chains of P(X), and the associated nm x nm Jordan matrix 
J = J\ @ J2 © • • • © Jr , where each J, is the Jordan block that corresponds to the 

X 



Jordan chain {a^i, 3^,2, 



.} of Aj. Then the nm x nm matrix Q 



XJ 



XT' 



is invertible [9], and we can define Y = Q 





A- 1 



L m 



The set (X, J, Y) is called a 



Jordan triple of P(A), and satisfies P(A) _1 = X(IX— J)~ 1 Y for every scalar A ^ cr(P) 
[9]. Motivated by the latter equality and [5j, we define the condition number of the 
eigenproble of P(X) as k(P) = \\X\\ \\Y\\ , where || • || denotes the spectral matrix 
norm, i.e., that norm subordinate to the euclidean vector norm. 



2.2 Companion matrix 

The (block) companion matrix of P(X) is the nm x nm matrix 





C P 








I 





~A-mA 



m—l 



It is straightforward to verify that 

E{X){XI - Cp)F(X) 



P(A) 




where -F(A) 



I 

XI 







and E(X) 





'm(n— 1) 



Ei(X) E 2 (X) 
-I 



(2) 



E m (X) 




_ A m 7 A" 1 " 1 / ••• / J [ -10 

with E m {X) = A m and E r (X) = A r + A-E r +i(A) for r = m — 1, m — 2, . . . , 1. It is 
also easy to see that detF(A) = 1 and detP(A) = ±det A m (7^ 0). As a consequence, 
o~(P) coincides with the spectrum of matrix Cp, counting algebraic multiplicities. 



1 Note that the definition of the condition number k(P) depends on the choice of the triple (X, J, Y), 
but for simplicity, the Jordan triple will not appear explicitly in the notation. 
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2.3 Weighted perturbations and pseudo-spectrum 

We are interested in perturbations of -P(A) of the form 

m 

Q(X) = P(A) + A(A) = ^(,1, • \)X J . 

3=0 



(3) 



where the matrices Ao,Ai,...,A m G C nxn are arbitrary. For a given parameter 
e > and a given set of nonnegative weights w = {wo, w±, . . . , w m } with u>o > 0, we 
define the class of admissible perturbed matrix polynomials 

B{P,e,w) = {Q(A) as in © : ||A^|| < ewj, j = 0,1,. ..,m} 

(recall that || • || denotes the spectral matrix norm). The weights wq, w\, . . . , w m allow 
freedom in how perturbations are measured, and the set B(P,e,w) is convex and 
compact with respect to the max norm ||P(A)||oo = max \\Aj\\ [3J. 

0<j<m 

A recently popularized tool for gaining insight into the sensitivity of eigenvalues 
to perturbations is pseudospectrum; see [3j EJ [I2j [24] and the references therein. The 
e-pseudo spectrum of P(\) (introduced in [24J) is defined by 

a E (P) = {fie a(Q) : Q(A) G B(P,e,w)} = { M G C : s min (P(fi)) < £«;(H)} , 



where s„ 



w m _iA 



m— 1 



•) denotes the minimum singular value of a matrix and u>(A) = w m A m + 
+ • • • + wiA + too- The pseudospectrum cr £ (P) is bounded if and only if 
ew m < s m i n (A m ) |12j . and it has no more connected components than the number of 
distinct eigenvalues of P(X) [3J. 



2.4 Condition number of a simple eigenvalue 

Let Ao G <j(P) be a simple eigenvalue of P(X) with corresponding right eigenvector 
xo G C n and left eigenvector yo G C n (where both xo and yo are unique up to 
scalar multiplications). A normwise condition number of the eigenvalue Ao, originally 
introduced and studied in [23] (in a slightly different form), is defined by 



k(P,X c 



lim sup 



\sx \ 



: det Q(A + <5A ) = 0, Q(X) G B{P, e, w) 



w (\^o\) Ikojj IboJ 
\y*oP'(^)xo\ 



(4) 
(5) 



Since Ao is also a simple eigenvalue of the companion matrix Cp, we define the 
condition number of Xq with respect to Cp as 



k{C P , A c 



IXol 



IV'oXoI 



(6) 



(see [HI [El [261 [27]), where 



Xo 



xo 
Aqxq 



and Vo 



^i(A )*yo 
^2(A )*yo 

-^m(Ao)*yo 



(7) 
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are associated right and left eigenvectors of Cp for the eigenvalue Ao, respectively. 
By straightforward computations, we can see that ^qXo = yoP'(Xo)xo. This relation 
and the definitions © and © yield the following [1 

w(\\ 



k(P,Xo) 



\Xo\ 



k(C P ,X ). 



(8) 



2.5 Condition number of a multiple eigenvalue 

Suppose that Ao £ cr(P) is a multiple eigenvalue of -P(A), and that po is the maximum 
length of Jordan chains corresponding to Ao- Then we can construct a Jordan triple 
ofP(A), 

/ T y*i. P0 1 \ 



(X,J,Y) 



[xi,i ■ • • x 1>P0 x 2 ,i ■■ ■] , J\ @ J 2 



\ 



(9) 



where J%, J 2 , ■ ■ ■ , J K0 are the poxpo Jordan blocks of Ao, and J contains all the Jordan 
blocks of Ao of order less than po and all the Jordan blocks that correspond to the 
rest of the eigenvalues of -P(A). Moreover, xi ) \,X2 t i, ■ ■ ■ ,x KQt i are right eigenvectors 
of -P(A) that correspond to J\, J 2 , ■ ■ ■ , J K0 , and yx ; i, 2/2,1, • • • ; Uk ,i are the associated 
left eigenvectors. Following the approach of [31 UTil HTJ [16] on multiple eigenvalues, we 



consider the matrices X 



x 2 ,i ■ ■ ■ X Kf)} l 



e C nXK0 and Y 



Via 

2/2,1 

- 2/k ,i 



and define the condition number of the multiple eigenvalue Ao by 

k(P,\ ) = w(\X \)\\XY\\. (10) 

Notice that since the matrices X and Y are of rank kq < n, the product X Y is 
nonzero and k(P, Ao) > (keeping in mind that vjq > 0). Moreover, if the eigenvalue 
Aq is simple, i.e., po = kq = 1, then the definitions © and ([TO]) coincide [TT] . 



3 Condition numbers of eigenvalues and pseudospectral 
growth 

Consider a matrix polynomial P(X) as in ([1]). Since the leading coefficient of P(X) 
is nonsingular, for sufficiently small e, the pseudospectrum cr £ (P) consists of no 
more than nm bounded connected components, each one containing a single (possibly 
multiple) eigenvalue of P(X). By the definition @ and the proof of Theorem 5 of 
[23j, it follows that any small connected component of o~ £ {P) that contains exactly 
one simple eigenvalue Ao £ o~(P) is approximately a disc centered at Ao- Recall that 
the Hausdorff distance between two sets 5,TcC is 



7i(S, T) = max < sup inf Is — t\, sup inf Is — t 
l, 6 5*eT 1 terms' 
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Proposition 1. If Ao G o-(P) is a simple eigenvalue of P(X), then as e — > 0, the 
Hausdorff distance between the connected component of cr £ (P) that contains Xq and 
the disc {/i G C : \fi — Ao| < k(P, Ao) e} is 

Next we extend this proposition to multiple eigenvalues of the matrix polynomial 
P(A), generalizing a technique of [10] for matrices (see also [1]). 

Theorem 2. Suppose that Ao is a multiple eigenvalue of P(X) and po is the dimen- 
sion of the maximum Jordan blocks of Ao- Then as e —* 0, £/ie Hausdorff distance 
between the connected component of pseudospectrum cr e (P) that contains Ao and the 

disc {juGC: A | < (A:(P,Ao)e) 1/po } is o^ 1 ^ ). 

Proof. Consider the Jordan triple (X, J, Y) of -P(A) in ([9]) and the condition number 
k(P, Ao) in ([10]) , For sufficiently small e > 0, the pseudospectrum a £ {P) has a 
compact connected component <5 e such that £/ £ n cr(P) = {Ao}- In particular, the 
eigenvalue Ao lies in the (nonempty) interior of Q e ; see Corollary 3 and Lemma 8 of 
[3]. Let also \x be a boundary point of Q £ . Then it holds that 

W^V)) = ew(\n\) and P{n)~ x = X{Ip- J)- X Y. 

Denote now by iV the po x pq nilpotent matrix having ones on the super diagonal 
and zeros elsewhere, and observe that 



iV P0 = and (IX-N) 



-i 



A- 1 A- 2 
A- 1 



X~po 
X-po+i 



Po-l 

A- 1 ^(A- 1 iVy (A/0). 

3=0 



As in [U [10] , we verify that 



Ia*-A |p° 

5min(-P(M)) 



l/i-Aonip- 1 ^)!! = |/i-A n|x(//i- j)- x y|i 



|/i - A ( 



\Po 



Xdiag {(//i - Jx)- 1 , . . . , (Ifi - J K0 )-\ (i> - J)- 1 } Y 



(p- A ) P0 ~ 1 Xdiag< 



Po-l 



^((/,-Ao)" 1 ^, 



3=0 



po-l 



. . . , (( M - Ao)- 1 ^', (ja - A )(J/i - J)- 1 [ y 

3=0 J 

For each one of the first kq diagonal blocks, we have 

po-l 

(/x - Aor" 1 (0* - Ao)- 1 ^^ = iV 190 - 1 + 0(/x - Ao). 

3=0 
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Thus, it follows 

Smin(-P(/-0) 



\X diag {N^- 1 + 0(/i - A ), . . . , N^- 1 + Ofji - A ), 0((/x - A ) w> )} ^ | 
IXdiagjATW'- 1 , . . . , TV™" 1 , 0} y|| + 0(\n - A |) 



[0 ••• xi,i ■■■ x 2) i •••] 



+ Ao|), 



where the right eigenvectors 2:1,1, £2,1, • • • , ^k ,i an d the rows l5 y\ 1 , . . . , y* K0 1 lie 
at positions po, 2po, . . . , kqPOj respectively. As a consequence, 



or 



or 



Smin(-P(/U)) 

e«;(|mI) ll^^ll 
\n - Ao 



= ||xy|| + o(|/i-A |), 

1 + 0(|m-Ao|), 
= l + r e , 



(^(^Ao)^ 1 ^ 
where r £ el goes to as e — ► 0. This means that 

IM-Aol = (fe(P,A )e) 1/po + o(£ 1/po ). 

Since \x lies on the boundary dQ £ , it is easy to see that the Hausdorff distance between 
g £ and the disc {/ieC: |/x - A | < (fc(i> A ) e) 1/po } is o(e 1 / po ). □ 

The above two results indicate how the condition number of an eigenvalue of 
.P(A) quantifies the sensitivity of this eigenvalue. Consider, for example, the matrix 
polynomial 

" (A - l) 2 A - 1 A - 1 
(A-l) 2 



P(A) 







A 2 



1 A 2 



1 



with det(P(A)) = (A - 1) 5 (A + 1) and a(P) = {1, -1}. The eigenvalue A = 1 has 
algebraic multiplicity 5 and geometric multiplicity 3, and the eigenvalue A = — 1 is 
simple. A Jordan triple of P(X) is given by 



X = 



1 1 
1 1 10 
0-1102 



J 



1 


1 
















' 





0.25 





1 
















1 





-0.5 








1 


1 













1 


-0.5 

























1 











1 

















1 







-1 


-1 


1 

















-1 










-0.25 



7 



The matrices of the eigenvectors that correspond to the maximum Jordan blocks of 

. Thus, for the weights 
Wo = Wl = W2 = 1, we have k(P, 1) = w(l) \\X Y\\ = 4.2426. 





' 1 


" 




eigenvalue A = 1 are X = 





1 


and Y = 




_ 


-1 





1 
1 



-0.5 





-0.2 0.2 0.4 



0.9 0.92 0.9. 



0.98 1 1.02 1.04 1.06 



Figure 1: The boundaries da e (P) for e = 1(T 4 , 2 • 1(T 4 , 4 • 1(T 4 , 8 • 10" 



The boundaries of the pseudospectra cr £ (P), e = 10~ 4 , 2- 10~ 4 , 4- 10~ 4 , 8-10 -4 , are 
illustrated in the left part of Figured! The eigenvalues of P(X) are marked with +'s 
and the components of the simple eigenvalue A = —1 are not visible. The components 
of the multiple eigenvalue A = 1 are magnified in the right part of the figure, and they 
are very close to circular discs centered at A = 1 of radii (k(P, 1) 1CF 4 ) 1 / 2 = 0.0206, 
(k(P, 1] )2 • 10" 4 ) 1 / 2 = 0.0291, (k(P, 1) 4 • 10" 4 ) 1 / 2 = 0.0412 and (k(P, 1) 8 • lO" 4 ) 1 ^ = 
0.0583, confirming Theorem [2j 



4 Distance from a given simple eigenvalue to multiplicity 

Let P(X) be a matrix polynomial as in ([T]), and let Ao be a simple eigenvalue of P(X). 
In the sequel, we generalize a methodology of Wilkinson [26] in order to obtain a 
relation between the condition number k(P,Xo) and the distance from P(X) to the 
matrix polynomials that have Ao as a multiple eigenvalue, namely, 

dist(P, Ao) = inf {e > : 3 Q(X) £ B(P, e, w) with Ao as a multiple eigenvalue} . 

The next proposition is a known result (see [U Theorem 3.2] and [U Proposition 
16]). Here, we give a new proof, which motivates the proof of the main result of this 
section (Theorem [5]) and is necessary for the remainder. 

Proposition 3. Let P(X) be a matrix polynomial as in Ao £ a(P)\a(P') and 
yo,XQ £ C n be corresponding left and right unit eigenvectors, respectively. If t/qP'(Ao)xo 
= 0, then Aq is a multiple eigenvalue of P(X). 
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Proof. By Schur's triangularization, and without loss of generality, we may assume 
that the matrix P(Aq) has the following form, 



^(Ao) 



b* 
B 



be 



in— 1 



, B G 



(n— l)x(n— 1) 



Moreover, since P(Xq)xq = 0, we can set xq = e\ = [1 • • • 0] T '. Then we have that 



yoP'(A )ei = 0, and hence, yoP'(A ) 

Since A $ cr(P') and 2/qP(A ) = 0, it follows 

y *P'(Ao){[P / (Ao)]- 1 P(A )} 



[0 io*] for some 0/w£ C n-1 . 
0, 



or equivalently, 



or equivalently, 



y* P'(X ) [^'(Ao)]- 1 



b* 
B 



0. 



[0 w*] 



a* 
4 



0. 



where a G C n 1 and j4 G C^ n x ) x ( n x ). As a consequence, w*A = and the 
matrix A has as an eigenvalue. Thus, is a multiple eigenvalue of the matrix 
[P / (Ao)] _1 P(A ). We consider two cases: 

(i) If the geometric multiplicity of G <t([P / (Ao)] _1 -P(Ao)) is greater than or equal to 
2, then rank(P(Ao)) < n — 2, and hence, Ao is a multiple eigenvalue of P(X). 

(ii) Suppose that the geometric multiplicity of the eigenvalue G o"([P'(Ao)] _1 -P(Ao)) 
is equal to 1 and its algebraic multiplicity is greater than or equal to 2. Then, keeping 
in mind that [P / (Ao)] _1 -P(Ao)ei = 0, we verify that there exists a vector z\ G C n such 
that [P'(X )]~ 1 P(X Q )z 1 = ex, or equivalently, P(X )(-z 1 ) + P'(A )ei = 0. Thus, A 
is a multiple eigenvalue of P(A) with a Jordan chain of length at least 2. □ 

Recall that the condition number of an invertible matrix A is defined by c{A) = 
\\A\\ ||j4 _1 || and it is always greater than or equal to 1. 

Theorem 4. Let P(A) be a matrix polynomial as in |7]j, Ao G a(P)\a(P') be a simple 
eigenvalue of P(X), and yo,XQ G C n be corresponding left and right unit eigenvectors, 
respectively. If the vector [P'(Ao)]*yo is n °t a scalar multiple of xq, then 



dist(P,A ) < 



c(P'(A )) ||P(A ) 



k(P,X )(\\y*P'(X )f 



;P'(Ao)x | 



1/2" 



Proof. As in the proof of the previous proposition, without loss of generality, we may 
assume that 



^(Ao) 



b* 
B 



b€ 



m— 1 



B G C'™ -1 - 1 ^™ -1 ) 



and xq = e%. If we denote 5 = yoP'(Ao)xo = Ho?' (^o)ei 0> then it is clear that 

y *P'(A ) = [5w*}, 
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for some w G C n . Furthermore, because |<5| < \\iiqP'(Xo) 

Since A ^ cr(P') and yoP(A ) = 0, it follows 



^ J P / (Ao){[P , (A r 1 P(A )} = 0, 



or equivalently, 



or equivalently, 



y *P'(A ) [P'(Ao)] 



i-i 



b* 
B 



0, 



[5 w* 



a* 
,4 







for some a G C n_1 and A G C^-lM™-!), If a = 0, then w* A = 0, and the proof of 
Proposition [3] implies that Ao is a multiple eigenvalue of P(A); this is a contradiction. 
As a consequence, a / 0. Moreover, 



w*A + 5a* = 0, 



and hence, 



w* A + 



-wa 



WW 



0. 








This means that if we consider the (perturbation) matrix E 
the matrix 

[P'(Ao)]- 1 P(A ) + P = [P'(Ao)]- 1 [P(Ao)+P'(A )P] 

has as a multiple eigenvalue. 
We define the n x n matrices 

A = P'(X )E and Q = P(A ) + A, 

and the matrix polynomial A(A) = Xw=o A?^ W1 ^h coefficients 



-wa 



then 



A; 



3 = 0, 1, 



VlAoiy u,(|Ao|) 

where (by convention) we assume that Ao/Ao = whenever Ao = 0. Then, denoting 

= "(B t^T' one can verif y that 

A(A ) = A and A'(A ) = </>A. 
We define also the matrix polynomial Q(X) = P(A) + A(A), and consider two cases: 

(i) Suppose that the geometric multiplicity of G aQP^Ao)] Q) is greater than or 
equal to 2. Then rank(Q) = rank(Q(Ao)) < n — 2, or equivalently, Ao is a multiple 
eigenvalue of the matrix polynomial Q(X) of geometric multiplicity at least 2. 

(ii) Suppose now that the geometric multiplicity of the eigenvalue G cr([P'(Ao)]~v) 
is equal to 1, and its algebraic multiplicity is greater than or equal to 2. Then, keeping 
in mind that Qe\ = 0, there is a vector z\ G C n such that 

[P'(X )]- 1 Qz 1 = e x , 
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or equivalently, 

Q(-z 1 ) + P'(X )e 1 = 0. (11) 

We observe that A'(Ao)ei = (pAei = 4>P' {Xo)Ee\ =0. As a consequence, ([TT]) is 
written in the form 

Q(Ao)(-zi) + Q'(A )ei = 0. 

Thus, Ao is a multiple eigenvalue of Q(A) with a Jordan chain of length at least 2. 

In both cases above, we have proved that Ao is a multiple eigenvalue of Q(X). 
Furthermore, we see that 



\E\ 












-wa 



< 



< 



K 

\\w\\ 

R 

\\w\\ 



WW 

' a 
A 

|[P'(A )] 





8 * 




wa 




w*w 



w 



-1 



1 1 If I 
l|P(A )|| 



|[P'(Ao)]- 1 P(Ac 



As a consequence, for every j = 0, 1, 



,m, 



I A,- 



Wj 



< 



< 



™(|Ao|) 

Wj 

MM 



W j 



A 



™(\\o\) 

^'(Ao)|| ll^ll 



|P'(Ao)£| 



1*1 



w(|A |) 1 1 to 

c(P'(A )) ||P(A 



[ J P'(Ao)]- 1 ||||P'(Ao)||||P(A c 



fc(P,A ) ||y *P'(Ao)|| 2 -^ 



1/2 ' 



and the proof is complete. 

The spectrum of the matrix polynomial 



□ 



P(A) 



IX + 



1 2 
0.25 
-0.5 



A + 




25 




15.25 



is o-(P) = {0, -1, 0.25 ±i 3.8971, ±i 5}. For the weights w 2 = \\A 2 \\ = 1, w\ = \\Ai\\ = 
2.2919 and w = \\A \\ = 25.0379, the above theorem implies dist(P, -1) < 0.4991. 
If we estimate the same distance using the method proposed in |17j . then we see 
that dist(P, -1) < 0.5991. On the other hand, for the eigenvalue 0.25 - i 3.8971, 
Theorem H yields dist(P,0.25 - i 3.8971) < 0.1485, and the method of [17] implies 
dist(P, 0.25 — i 3.8971) < 0.1398. At this point, it is necessary to remark that the 
methodology of [17J is applicable to every complex number and not only to simple 
eigenvalues of P(A). 
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5 An expression of k(P, Ao) without eigenvectors 



In this section, we derive a new expression of the condition number k(P, Ao) that 
involves the distances from Ao G °~(P) to the rest of the eigenvalues of the matrix 
polynomial P (A), instead of the left and right eigenvectors of Ao- The next three lem- 
mas are necessary for our discussion. The first lemma is part of the proof of Theorem 
2 in |19j . the second lemma follows readily from the singular value decomposition, 
and the third lemma is part of Theorem 4 in [T3] . 

Lemma 5. For any matrices C,R,W G C nxn , R &d](WCR) W = det(WR) adj(C). 

Lemma 6. Let A be an n x n matrix with as a simple eigenvalue, s\ > S2 > ■ ■ ■ > 
s n _i > s n = be the singular values of A, and u n ,v n G C n be left and right singular 
vectors of s n = 0, respectively. Then u n and v n are also left and right eigenvectors of 
A corresponding to 0, respectively. 

Lemma 7. Let A be an n x n matrix with as a simple eigenvalue. If s\ > S2 > 
■ • • > s n -i > s n = are i/ie singular values of A, then ||adj(j4)|| = S1S2 ■ ■ ■ s n -i- 

The following theorem is a direct generalization of Theorem 2 of [19J. 

Theorem 8. Let P(A) 6e a matrix polynomial as in {2|j w;ii/i spectrum o~(P) = 
{Ai, A2, • • • , A nm }, counting algebraic multiplicities. If Aj zs a simple eigenvalue, then 



k(P,Xi) 



w(\\i\) ||adj(P(A i ) 



det A r 



A,; 



Proof. For the simple eigenvalue Aj G o~(P), consider a singular value decomposition 
of matrix -P(Aj), 



P(Aj) = f/Sr = C/diag{ Sl ,...,s n _i,0}y*. 



Then we have 



U* 






n(m— 1) 



and Lemma [5] implies 



P(A.) 






'n(m— 1) 



V 






n(m~l) 



s 









V 








adj 



'n(m—l) 

= det(C/*F) adj 



S 






In(m—1) 

P(Ao) 










^n(m— 1) 



I, 





n(m— 1) 



(12) 



where |det(E7*V)| = 1. 

Let u n ,v n G C n be the last columns of f7 and V, respectively, i.e., they are left and 
right singular vectors of the zero singular value of P(Aj). Then by Lemma[6l y, = u n 
and Xi = v n are left and right unit eigenvectors of Aj G <r(P), respectively. Let also 
V>?, and Xi be the associated left and right eigenvectors of Cp for the eigenvalue Aj 
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given by ©. Then by ©, [19, Theorem 2], Lemma © and CE]) (applied in this 
specific order), it follows 



k(P, Xi 



HIM) 



k(C P ,\i 



w(\Xi\) || adj (Xil -C P ) || 



n^iAi-A*i 

«;(|Ai|) ||F(Ai) adj( J B(A i )(A i /-Cp)F(A l ))S(A,)|| 



|det(if(Ai)^(Ai))| n^lA^-A 
P(Ai) 



F(Ai) adj 



J, 



n(m— 1) 



£(A;) 



| det( J P(A i )^(A i ))| IA,- - A, 



MM) 



v o 



I, 



n(m— 1) 



adj 







7, 



n(m— 1) 



17* 



7 



n{m— 1) 



#(Ai) 



Thus, 



where 



G = F(Ai) 
Moreover, 



7 



|detAn| II, ,, Aj A, 
||G|| 



det^ m | n^JAj -Ai|' 



(13) 



7, 



n(m— 1) 



adj 



7 





n(m— 1) 



17* 



7, 



n(m—l) 



E(Xi). 



adj 



S 

7n( m -l) 



5 







n(m— 1) 



where S 1 = S1S2 • • • s n _idiag{0, . . . , 0, 1}. As a consequence, the matrix G is written 



G = F(Xi) 



V 

I n (m-1) 

VSU* 



S 







n(m— 1) 



?7* 

In(rn-l) 



E(Xi) 







n(m— 1) 



7SE/* ••• 
XiVSU* ••• 



E(Xi) 



E(Xi) 



XT^VSU* 







sis 2 • • • s n _i 



AjU n ti* 



A™-^ n < 



S(Ai) 



13 



sis 2 ■ ■ • S n _l 



XiVi 



S1S2 ■ ■ ■ Sn-1 



XiXiylE^Xi) 



x iy *E m (\i) 
XiXiy*E m (\i 

i m— 1, 



L \™- L x i y*E l {\ i ) ••• X™- L x t y*E m (Xi 



sis 2 ■ ■ • S n _l 



X i 

XjXi 



XT^Xi 



E 2 {h)*Vi 
E m {\i)*yi 



= sis 2 - • • s n _i(xi^*)- 
Hence, by (|13p and Lemma [71 it follows 

w(|Ai|) (sis 2 - • -Sn-i) WXiip* 



k(P,Xi 



Since \\xii>* 



w{\\i\) ||adj(P(Ai 



|Xi|| HV'ill I det An I n^yi \^j ~ M \detA m \ Hj^lXj - \i\ \\xi 
II = llXill llV'ill ) the proof is complete. 



□ 



The next corollary follows readily 



Corollary 9. Let P(\) be a matrix polynomial as in (OP with spectrum cr(P) = 
{Ai, A 2 , • • • , A nm } ; counting algebraic multiplicities. If A, is a simple eigenvalue of 
-P(A) with yi,Xi 6 C n associated left and right unit eigenvectors, respectively, then 



min I Xj — A,; I < 



w{\Xi\) [|adj(P(Ai 



K*P'(A t )x,| ||adj(P(A, 
I det AJ 



fc(P,A;)|detA m | 

Moreover, if the vector [P'(Ao)]*yo is not a scalar multiple of xq, then 

c(P'(Xi)) \\P(Xi)\\ I det A 



dist(P,Ai) < 



w(\Xi\) ||adj(P(Ai 



|j/?P'(Ai)xi| 2 



1/2 IIl A i _Ai l- 



It is remarkable that for the simple eigenvalue Aj € c(P), Theorem [8] and the 
definition ([5]) yield 



det An | Ylj^i l<\? ~ 
l|adj(P(A 4 ))|| 



^ 



X; 



1) 



Thus, Proposition [3] follows as a corollary of Theorem [HJ and the size of the an- 
gle between the vectors [P'(Xi)]*yi and Xi is partially expressed in algebraic terms 
such as determinants and eigenvalues. Note also that Aj is relatively close to some 
other eigenvalues of P(A) if and only if k(P, Xi) is sufficiently greater than the quan- 
tity w(|Aj|) ||adj(P(Aj))|| |det An| _1 - Furthermore, the condition number k(P, Xi) is 
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relatively large (and Aj is an ill-conditioned eigenvalue) if and only if the product 
Ylj^i |Aj — Aj| is sufficiently less than tu(|Ai|) ||adj(P(Aj))|| Idet^ml^ 1 . 

To illustrate numerically the latter remark, consider the matrix polynomial 



P(A) 



0.001 
1 



A 2 + 



-0.003 
-7 



A + 



0.002 0.001 
12 



with (well separated) simple eigenvalues 1, 2, 3 and 4, and set Wq = Wi = u>2 = 1- 
Then it is straightforward to see that for the eigenvalues A = 1 and A = 2, 

w(l) ||adj(P(l))|| 

k(P, 1) ^ 3000, |2 - 1| |3 - 1| |4 - 1| = 6 and v ; " JV \ JJU 18000, 

det Am. 



and 



k(P,2) ^ 7000, |1 - 2| |3 - 2| |4 - 2| = 2 and W<y2 \ \^^E!fM 14000. 

det A m 



On the other hand, for the eigenvalue A = 4, we have 

k(P, 4) ^ 21.2897, |1-4||2-4||3-4| =6 and !gWHjCg(f))j! ^ 127.738. 

ciet A m 




Figure 2: The boundaries da £ {P) for e = 5 • 10~ 5 , 10~ 4 , 2 • 10~ 4 . 



The left part of Figure [2] indicates the boundaries of the pseudospectra <7 e (P) for 
e = 5 • 10~ 5 , 10" 4 , 2 • 10~ 4 . The eigenvalues of P(X) are marked with +'s. The small 
components of a 5 .i -5(P), a w -i(P) and o" 2 . 10 -4(P) that correspond to the eigenvalue 
A = 4 are not visible in the left part of the figure, and they are magnified in the right 
part. Note that these components almost coincide with circular discs centered at 
A = 4 of radii k(P, 4) e, e = 5 • 10~ 5 , 10~ 4 ,2 • 10 -4 , as expected from Proposition [TJ 
It is also apparently confirmed that the eigenvalue A = 2 is more sensitive than the 
eigenvalue A = 1 (more particularly, one may say that the eigenvalue A = 2 is more 
than twice as sensitive as A = 1), and that both of them are much more sensitive 
than the eigenvalue A = 4. 
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6 An Eisner-like bound 



In this section, we apply the Eisner technique [7] (see also [21]) to obtain a perturba- 
tion result for matrix polynomials. This technique allows large perturbations, yielding 
error bounds, and it does not distinguish between ill-conditioned and well-conditioned 
eigenvalues. 

Theorem 10. Consider a matrix polynomial P(A) as in and a perturbation 
Q{\) G B(P,e,w) as in (0). For any fi G a(Q)\a(P), it holds that 



min \n — A| < 

xea(P) 



ew(\fi\) \ mn 



det A r 



1^)11 



l— 



Proof. Let a(P) = {\ u A 2 , 
H G a(Q)\a{P). Then 



Anm} 5 counting algebraic multiplicities, and suppose 
|detP(/i)| 



min \u — \\ nm < \\\u — \i 
A GCT (P) l\ 



det A r . 



Let now U = [u\ U2 • • • u n ] beannxn unitary matrix such that Q(fi)u\ = 0. By 
Hadamard's inequality [21] (see also |22l Theorem 2.4]), it follows 



mm u — A 

Aecr(P) 1 



< 



< 



< 



< 



detP(//)| |deti>(/z)| |detC/| 



det A n 
1 



det A- 



detA m | 
1 



det A m | 
1 



det >l m | 
1 



det A m | 



det A r , 



»=i 

r 

np^Kiinii^ 

i=2 

[ip(^)«i-q(m)«iii nn p (^ 

||A( M H||||P0u)|r 

np^ir- 1 - 1 



i=2 
mm— 1 



and the proof is complete. 



□ 



Recently, the classical Bauer-Fike Theorem [2] has been generalized to the case of 
matrix polynomials [5J. Applying the arguments of the proof of Theorem 4.1 in [5], 
it is easy to verify the "weighted version" of the result. 

Theorem 11. Consider a matrix polynomial P(A) as in {!]) and a perturbation 
Q(A) G B(P, e, w) as m (0), and Zet (X, J, Y") be a Jordan triple of P(A). For any G 
a(Q)\o~(P), it holds that min |/i — A| < max i? 1 ^) , where $ = p k(P) ew(\/j,\) 

Xecr(P) 

and p is the maximum dimension of the Jordan blocks of J. 
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To compare these two bounds, we consider the matrix polynomial 



P(A) 



/A 3 + 



V2 
y/2 



A 2 + 



-1 

1 



(see [5J Example 1] and [9] Example 1.5]) with detP(A) 
Jordan triple (X, J, Y) of -P(A) is given by 



X 



1 





V2 + 1 
1 



y/2-2 V2 + 1 V2 + 2 
1 



and Y 



T 



-4 v/2 + 2 




V5- 
1 



J 



-V2 + 2 




A 2 (A + 1 


) 2 (A 


- r 


2 . 


























1 


1 











1 














-1 


1 











-1 



V2 + 1 
-1 



The associated condition number of the eigenproblem of -P(A) is k(P) = 6.4183. For 
e = 0.3 and w = {wq, wi,W2, w^} = {0.1, 1, 1, 0}, the matrix polynomial 



Q(A) 



/A 3 + 



i0.3 

V2 



V2 
-i 0.3 



A 2 + 





0.7 



-0.7 




A + 



0.01 
0.03 



lies on the boundary of £> (P, 0.3, w) and has fi = 0.5691 + i 0.0043 as an eigenvalue. 
Then min \u — A| = 10.5691 + i 0.0043 — 1| = 0.4309, the upper bound of Theorem 

[10] is 0.8554, and the upper bound of Theorem [TT1 is 3.8240. 

It is clear that the Elsner-like upper bound is tighter than the upper bound of 
Theorem [11] when [|P(/i)[| is sufficiently small; this is the case in the above example, 
where ||P(0.5691 + i 0.0043) || = 1.0562. In particular, if we define the quantity 



n(p,E,M) 



|det A m | 
det A m \ 



{p k{P)) mn {ew^^))™- 1 , when p k(P) ew(\fi\) > 1 

ran ran -j^ 

(p k(P)) p (ew(\fj,\)) p , when p k(P) ew(\fi\) < 1 



then it is straightforward to see that the bound of Theorem [10] is better than the 
bound of Theorem [TT] if and only if ||P(//)[| < £l(P, e, /u) . 
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